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Abstract —We consider a distributed parameter estimation 
problem, in which multiple terminals send messages related to 
their local observations using limited rates to a fusion center who 
will obtain an estimate of a parameter related to observations of 
all terminals. It is well known that if the transmission rates are 
in the Slepian-Wolf region, the fusion center can fully recover 
all observations and hence can construct an estimator having the 
same performance as that of the centralized case. One natural 
question is whether Slepian-Wolf rates are necessary to achieve 
the same estimation performance as that of the centralized case. 
In this paper, we show that the answer to this question is 
negative. We establish our result by explicitly constructing an 
asymptotically minimum variance unbiased estimator (MVUE) 
that has the same performance as that of the optimal estimator 
in the centralized case while requiring information rates less than 
the conditions required in the Slepian-Wolf rate region. 

Index Terms —Distributed learning, MVUE, Slepian-Wolf 
rates, universal encoding/decoding scheme. 

1. Introduction 

There are two main different setups for statistical learning: 
centralized learning and distributed learning. In the centralized 
learning, which has been studied extensively, all data is 
available at a centralized location. In the distributed learning, 
data is stored in multiple terminals. The distributed learning 
setup has attracted significant recent research interests as the 
data involved in learning is increasingly large in volume and 
might be stored in multiple terminals Q, O. lEl, SI. For the 
distributed learning, each terminal either has a few observa¬ 
tions about all variables, or has full knowledge about a subset 
of variables (all observations about a subset of variables). The 
first scenario is relatively easier since each terminal can still 
make its own local inference without even communicating 
with each other, while communication between terminals is 
essential for the second scenario. In this paper, we focus on 
the more challenging second scenario. 

In particular, we consider a distributed parameter estimation 
problem. In the setup considered, there are two random 
variables (X, Y) with a joint probability mass function (PMF) 
Po{X,Y) parameterized by an unknown parameter 9. Two 
terminals A and B observe X'^ and Y^ respectively and send 
messages related to their own local observations with limited 
rates to terminal C, which will then obtain an estimate of the 
unknown parameter. It is well known that if the transmission 
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rates from the terminals are inside the Slepian-Wolf rate 
region a, there exists a universal coding scheme lO that 
enables terminal C to fully recover {X'^^Y'^). Hence, once 
the transmission rates are inside the Slepian-Wolf rate region, 
the performance of the best estimator for the distributed setup 
is the same as that of the best estimator for the centralized 
case. 

One natural question is: are Slepian-Wolf rates necessary 
to achieve the same estimation performance as that of the 
centralized case? The answer to this question has significant 
implications in the distributed estimation. If the answer is yes, 
then to obtain the best estimate of the unknown parameter 
requires transmission rates to be so high that they are sufficient 
to fully recover the observations at the decoder, hence no 
rate reduction is possible. On the other hand, if the answer 
is no, then the observations can be compressed beyond the 
limits of source coding for full observation recovery. At a 
first glance, the answer to this question should be no as we 
are only interested in estimating a parameter related to the 
observations and are not interested in recovering the observa¬ 
tions themselves. However, all existing related works indicate 
otherwise. For example, |[7| addressed the same question 
and suggested that Slepian-Wolf rates might be necessary. In 
addition, the performance of the best known estimator by Han 
and Amari ID does not match that of the centralized case 
when the information rates are outside of the Slepian-Wolf rate 
region. Furthermore, showed that, under certain conditions, 
extracting even one bit of information from distributed sources 
is as hard as recovering full observations and hence requires 
the information rates to be in the Slepian-Wolf rate region. 

In this paper, we show that the answer to this question is 
indeed no. We establish our result by explicitly constructing 
a distributed estimator that achieves the same performance as 
that of the optimal estimator for the centralized case while 
using information rates outside of the Slepian-Wolf region. 
In particular, we consider binary symmetric sources (i.e., 
both X'^ and Y^ are binary sequences) parameterized by 
an unknown parameter 0. In our scheme, we first design a 
universal coding/decoding scheme that enables terminal C to 
compute = X^ 0 Y^, which can be achieved using rates 
outside of the Slepian-Wolf rate region, and then construct 
an estimator using We show that our estimator is an 
asymptotically minimum variance unbiased estimator (MVUE) 
cni and achieves the same variance index as that of the 


best estimator in the centralized case. We further extend our 
scheme to a more general class of joint PMFs and show 
that our scheme can also achieve the same performance as 
that of the best estimator in the centralized case while using 
transmission rates less than the conditions required in the 
Slepian-Wolf rate region. The key idea of our scheme is, 
instead of fully recovering the source observations, we aim to 
recover sufficient statistics at terminal C using less information 
rates. 

The rest of the paper is organized as follows. We introduce 
the problem formulation in Section |I^ In Section III 


we 


establish our main results for the binary symmetric sources. 
We extend our work to a more general class of information 
sources in Section We present the simulation results in 
Section |V| Finally, we conclude the paper in Section VI 


II. Problem Formulation 


Consider two information sources X and Y taking val¬ 
ues from the discrete alphabets T' and y, respectively. 

= {{Xi.Yi)}'^^-^ are n independently and identi¬ 
cally distributed (i.i.d.) observations drawn according to the 
parametric joint PMF Po{X,Y) where G 0 is the unknown 
parameter. We consider a distributed setup in which X'^ are 
observed at terminal A and Y^ are observed at terminal B. 
Using limited rates, these two terminals send messages related 
to their own local observations to a fusion center (terminal C), 
which will then obtain an estimate 0 of 0 using these messages. 
The setup is illustrated in Fig. 


Pe{X,Y) 



Terminal B 


Fig. 1: System Model. 


In particular, terminal A employs an encoding function 
gi : X'^ gi{X'^), while terminal B employs an encoding 

function g 2 : Y'^ g 2 {Y^). The code rates are 

n n 

where ||^i|| is the cardinality of the encoding function gi. 

From gi{X'^) and g 2 {Y'^), the decoder obtains an estimate 
0 of the unknown parameter 0 using estimator 

0 = i;{g,{X^),g2{Y^)). ( 2 ) 

To evaluate the quality of the estimator, we use the variance 
index that is defined as 

V[0] = lim nVare[0] = lim nEe[{0 — E[0])‘^]. (3) 

n^oo n^oo 

It is desirable to have an estimator that is asymptotically 
unbiased, i.e., Eo[0] 0 sls n ^ oo, and has a small variance 

index. 


It is well-known that, if the coding rates satisfy (will be 
called Slepian-Wolf rates in the sequel) 


Rx > He{X\Y), ( 4 ) 

Ry > He{Y\X), ( 5 ) 

Rx + Ry > He{X,Y), ( 6 ) 

there exists universal source coding schemes ii (i.e., the 
coding scheme does not depends on the value of the unknown 
parameter 0) such that the decoder can reconstruct X'^ and 
Y'^ with a diminishing error probability. Here, He{') and 
He {'A denote the entropy and conditional entropy respec- 
tivel}Q Hence, if are satisfied, we can obtain the same 

estimation performance as that of the centralized case. 

The question we ask in this paper is: are Slepian-Wolf rates 
necessary to achieve the same estimation performance as that 
of the centralized case? [II investigated the same question 
and suggested that Slepian-Wolf rates appear to be necessary 
for achieving the centralized estimation performance. In this 
paper, we show that Slepian-Wolf rates are not necessary. 
In particular, we show that there indeed exists a class of 
PMFs and the corresponding distributed estimators that require 
communication rates less than the Slepian-Wolf rates while 
still achieving the same performance as that of the best 
estimator for the centralized case. 

Throughout the paper, we use an upper case letter Z to 
denote a random variable, and a lower case letter ^ to denote 
a realization of Z. For any sequence z'^ = • • • , z{n)) G 

the relative frequencies (empirical PMF) 7r{a\z^) = 
n{a\z^)/n,ya e Z of the components of z^ is called the 
type of z'^. Here n{a\z'^) is the total number of indices t at 
which z{t) = a. 

HI. Binary Symmetric Case 

In this section, we consider the case of binary symmetric 
sources with \X\ = \y\ = 2 and a joint PMF of {X,Y) as 
given in Table in which the unknown parameter 0 e Q = 
( 0 , 1 ). 


XjY 

0 

1 

0 

6»/2 

(l-0)/2 

1 

1 

1-1 

0/2 


TABLE I: The joint PMF of binary symmetric sources. 


We show that, to estimate 0 for this class of PMFs, we can 
achieve the centralized estimation performance using rates that 
do not satisfy 0-0- We establish this result using two steps: 
1) in the first step, we design a universal encoder at terminals 
A and B and universal decoder at terminal C to compute the 
modulo-two sum = X'^ 0 Y ^; 2) in the second step, we 
construct an estimator using Z^. 

^Throughout the paper, we use the subscript 0 to emphasize the fact that 
value of the quantity of interest depends on the parameter 6. 














A. Step 1: Comupting 


The error probability, averaging over all possible codebooks, 


Here, we discuss how to universally compute = X'^ 0 
at terminal C. Towards this goal, we will use the same 
linear code at both encoders and use a minimum entropy 
decoder at terminal C. 

Since the encoders at terminals A and B are the same, we 
use the following simplified notation 

f = gi = 92, 

R = Rx = Ry‘ (7) 

The following theorem shows that as long as i? > 
He{X\Y) = He{Y\X), the decoder can reconstruct Z^ with 
a diminishing error probability. 

Theorem 1: If 

R>He{X\Y) = He{Y\X), ( 8 ) 

there exist universal encoding/decoding functions to recon¬ 
struct Z^ = X^ 0 Y^ at terminal C with an exponentially 
decreasing error probability. 

Proof: The proof follows a similar structure as the proofs 
in m and Q. In particular, using the ideas in || 6 |, we modify 
the proof of m to make it universal. 

Random Code Generation: We use a linear code / with 
an encoding matrix A of size n x nR to map {0, to 
{1,2,..., 2^^}. Hence 11/| | = 2^^. We independently generate 
each entry of A using a uniform binary distribution, i.e., each 
entry of ^4 is 0 or 1 with probability 0.5. 

Encoding: The encoded messages of the realizations G 
{ 0 , 1 }’^ and y'^ G { 0 , are 

f{x^) = x^A, 

f{y-) = y-A. (9) 

Decoding: The decoder first combines the messages into a 

single message as 

/(x")©/( 2 /"), ( 10 ) 

in which 0 denotes the element-wise xor. 

It follows from the code linearity that 

/(^")©/(y") = /(^"©y") = /(^"). ( 11 ) 

From ^ y^), terminal C uses a minimum entropy 

decoder to obtain z'^. In particular, for each such that 
f{z^) = f{x'^ ^ y'^), the minimum entropy decoder first 
calculates the entropy of its type, then picks the one that has 
the least entropy to be the decoded sequence. In the following, 
to simplify the notation, we use Z^^^ and Z^^^ to denote 
dummy random variables whose PMFs P^(n) and P^(n) are 
the same as the types of z^ and z'^, respectively. The final 
decoded message is denoted as 

5 ” = ( 12 ) 

where f denotes the minimum entropy decoding function. 
Error Probability Analysis: A decoding error occurs if and 
only if there exists a sequence z^ 7 ^ z^ such that 

f(z^) = f(z^) and (13) 


is 


pW = ^ P^{z^)P^{z^ ^ z^). (14) 

z^e{o,i}^ 

To analyze the probability of the decoding error, let z'^ G 
{0,1}’^ denote another sequence such that 


/(5”)=/(z”). (15) 


Let Z^'^^ be a dummy random variable whose PMF P^in) 

(n) 

is the same as the type of z^. Define P^~ as the set of all 
joint types between any two sequences z'^ and z'^. For any 
given / (equivalently for a given encoding matrix A), define 
Nf{ZZ) as the number of sequences z^ such that there exists 

fn) 

another sequence z^ having the joint type Pz(n)z(n) G 
and ( p~5] ) holds. 

Since each entry in A is uniformly distributed, then each 
element in f{z^) is uniformly distributed if z^ is a nonzero 
sequence. Therefore, 


Prifiz-) = 0) = (0.5)”« 



(16) 


in which the probability is computed over all codebooks. This 
implies that 


Prifin = /(^")) = Pr{f{z^ - zn = 0 ) 



(17) 


Define as the set of all sequence pairs {z'^^z'^) 

that have the joint type Pz{n)z(n), Pp^(n) 
all sequences z'^ that have the marginal type PzM, and 
(^^) as the set of all sequences z^ that have the 
joint type P^(n) z(n) with z^. The sizes of the sets Tp and 
Tp, . , . (z'^) are bounded as ifT^ 




\TPzi., I 

(^")l 




(18) 


where e is an arbitrary small number. Notice that, for any 
given P^(n)^(ri), NJ{ZZ) is a random variable (random over 
/) that can be expressed as 


NJ{ZZ) 


and (^",5") 

= ^^z(n) 

and 5”Grp,^„,,^<„,(^”)),(19) 


where !(•) is the indication function. The expectation of 


NJ{ZZ) over all possible codebooks / is 

¥\N]{ZZ)] 

= Y. : /(5")=/(^"), 


z'^eTp 




^ E 


and5”GTp,, , 

^ Prifin = /(^")).(20) 


z'^eTp , , z^eTp , , , , (z^) 


<[T3, ([T§, and ( [2Q| imply that 

E[Nf{ZZ)] < - 


( 21 ) 


Applying the Markov’s inequality, we have 

|Z(-))+e) ^|p(n) | ^ ^ 


Pr yNf{ZZ) > 

1 


< 


i^Si+«’ 

?(”)l 


( 22 ) 


where is the total number of possible joint types and S 

is an arbitrary small number. To simplify the notation, let 

B’^(ZZ)^ -. (23) 


Considering all joint types Pz{^)z(^) simultaneously, the 
union bound and (plji imply that 


(n]{ZZ) < B^iZZ), G 




^ 1 -E 

> 0 . 


r K^i+s 


(24) 


Since the probability in ( [24| ) is positive, then there exists 
a codebook /* that the following equation holds for all joint 
types simultaneously 

Nf,{ZZ)< - \ (25) 


11/1 


As 11/* 11 = 2”-^ and < (n + 1)^, we further have 

Nf, {ZZ) (26) 

< ((„ + 1)4 + ^) 

In the following, we will focus on /*. 

Let P^^L{ZZ) denote the portion of error probability 


ej* 

associated with a fixed joint type P, 


Z(ri)z(ri) 


PipZZ) (27) 

4 ^ P,{^")l(35"^^":r(5")=r(^"), 


z'^eTp 


z(^) 


and (z^,z^) eTp , ,). 


The total decoding error probability when using /*, 

can be expressed as 


p(’^) _ 

^e,/* - 


d(’^) 

e,P 


(ZZ). 


(28) 




Let 


denote the set of marginal types Pz(.) such that 
Pz(n){z = i) — Pe{z = i)| < y for i G {0,1}, where c i is 


an arbitrarily small number. Using the definition of A^J^\ ( |28 ) 
can be rewritten as 


p(’^) _ 

^e,/* - 


A 


E Pipzz) 

+ E P^pzz) 

PzMzMPzMeAi’C> 

S 1 +S 2 , (29) 


where denotes the complimentary set of Ae"1 For S 2 , 
we have that 


Pg^’].l(ZZ) < (30) 

where D{Pz(n)\\Pe{Z)) is the KL divergence between the 
marginal type P^(n) and the true PMF Pe{Z) of Z = X 0F. 
Using Pinsker’s inequality, for Pzir.) G Ae"1 we have 

P(P2(„)||P0(Z))>2e^ (31) 


Therefore, 

S 2 < ^ 

< (nAl)"^ (32) 

( [3^ implies that 82 ^^ exponentially as n ^ 00. 

For Pi, we have that 

P^'^f\(ZZ) < Nf,{ZZ) (33) 

Using ( [26| ), we further have 

P,(}l(ZZ)< (34) 

((n + 1)4 + 5) 2 -"('C'('Pz(-*)IIA(z))+it-rf(z(”>|z("))-€)_ 

As we use the minimum entropy decoder, we have P(Z("1 < 
P(Z(")), which implies P(Z(")|Z(")) < P(Z(")) < 

P(Z(”)). Therefore, 

P^pZZ) (35) 

< ((^ + 1)4+5) 2-"(45(4"z{»)l|4"AZ))+i?-H(zW)-e)_ 

Since P^Cn) € Ae"1 it is easy to check that 

|P(Z(")) -P,(Z)| < P(P^(„)||P,(Z)) + 62 . (36) 


Here 

£2 =-y F^°S-P6'(2: = 1, (37) 

i 

which can be made arbitrarily small as ei | 0 for 0 G (0,1). 









Therefore, 


The variance index is given by 


P^^iZZ) 


(38) 


< ((n + 1)4 + (5) , 


in which 63 = e + 62 . 

This implies that Si ^ 0 exponentially as n 
R>He{Z). 


00 if 


Therefore, (39) is sufficient to guarantee that 


(n) 

e,/* 


Theorem implies that the required rates to decode 
Xn 0 yn ^ small error probability is 


Rx>H0{X\Y), 

RY>He{Y\X). 


V[ 0 c] = lim nVsiVelOc] = 0{1 — 0 ). 


(47) 


The Cramer-Rao lower bound (CRLB) of the centralized 
case 


(39) 


0 


CRLB = -1/Ee 

e{i-e) 


\n[Pg{x^,y^)] 

d^e 

= yM^c]- 


(48) 

(49) 


exponentially as n ^ (X). It is easy to check that Ho{Z) = 
H 0 {X\Y) = H 0 {Y\X). The proof is complete. 


This implies that Oc is an MVUE for the centralized case. 
Now, come back to our decentralized case. For our estimator 


(40) 

(41) we have that 


<9 = 


n(0|Z^) 


(50) 


This rate region is larger than the Slepian-Wolf region in 0 - 
0 , as the condition Rx + Ry ^ 3^) is not necessary 

anymore. 


B. Step 2: Estimation 

After obtaining which is equal to Z^ with a probability 
converging to 1 exponentially, we then design an asymptoti¬ 
cally MVUE of 0. Our estimator is 


n{0\Z^) 


(42) 


in which the notation n('|') is defined in Section]^ 

Theorem 2 : If the conditions in Theorem [T] are satisfied, the 
estimator in ( |42| ) is an asymptotically MVUE and achieves the 
optimal variance index as that of the centralized case. 

Proof: 

Consider the centralized case in which X'^ and Y'^ 
are both known perfectly. Let (^5 denote 

the joint type of the sequences and where 

(ni, 77 - 2 , ns, n 4 ) are the frequencies of occurrence of the pairs 
{(0, 0), (1,1), (0,1), (1, 0)}, respectively. The joint PMF of 
is 


/^\ (ni+n2) / -I (n3+n4) 

Consider the centralized estimator 
^ ^ {ni + nz) 


This estimator is unbiased since 

M 0 c] = e. 


(43) 


(44) 


(45) 


The variance of the estimator is calculated as 

Var0[0c] = + n2f] - 9“^ 

0{l-9) _ 

n 


(46) 


Pr{n{0\Z^) = ni + na) > 1 - , (51) 

(n) 

in which ^ is shown to converge to zero exponentially fast 
in Section UlI-AI 
Therefore 

E 0 [O] = Pr{n{ 0 \Z^) = niYn 2 )E 0 [Oc] 

+ (1 - Pr{n{0\Z^) =niP n 2 ))KY (52) 

where Ki G [0,1] is a constant. As n ^ cx), P^^f* 0 and 

hence Pr{n{0\Z^) = ni + n 2 ) ^ 1. Therefore, 

lim E 0 [O]=E 0 [§c]=O. (53) 

n^oo 

This shows that our estimator is asymptotically unbiased. 
Similarly, 

V[0] = lim nVar 0 [^] 

n^oo 

= lim {nPr{n{0\Z^) = m + n 2 ){Eg[el] - (E^^e])") 

n^oo 

+n(l - Pr{n{0\Z^) = m + n 2 ))(iT 2 - ^i)), 

where K 2 G [0,1] is a constant. As n —> 00 , —> 0 

exponentially. Therefore, 

V[0]=0{1-0) = V[0,]. (54) 

This proves that our estimator is asymptotically unbiased and 
achieves the same minimum variance that can be achieved 
even in the centralized case. Hence, our estimator is optimal. 

■ 

Combining Theorems and we conclude that, in the 
distributed parameter estimation, the Slepian-Wolf rates are not 
necessary to achieve the same optimal estimation performance 
as that of the centralized case. Fig. [^illustrates the comparison 
between the Slepian-Wolf rate region and the rate pair used in 
our estimator. 














Ry 


Therefore, as long as 


HeiX) 


He{Y\X) 


Slepian-Wolf Region 



He{X\Y) He{X) 


Rx 


Fig. 2: -k\ the rate pair required in our estimator, which is 
outside of the Slepian-Wolf rate region. 


IV. Extension 


In this section, we extend our results obtained in Section ||n| 
to a more general class of joint PMFs. Let X = y = 
{0,1,..., M — 1} and the class of PMFs be 


Pg{X = i,Y = j) 


,if + 

AT ’ Otherwise, 


(55) 


where 0 G 0 = (0, Notice that each information 

source has a uniform marginal PMF and setting M = 2 
recovers the binary case. 

Similar to the binary case, we first use a linear code and 
minimum entropy decoder to reconstruct = {X^ + 
mod M at the decoder and then design an estimator from 
In this section, we use mod M to denote element-wise mod 
operation. 

In particular, we use a linear code / that maps {0,1,..., M— 
to {0,1,..., M — 1}^. The encoded messages of the real¬ 
izations G {0,1,..., M — 1}'^ and G {0,1,..., M — 1}'^ 
are 


f{x^) = 

f{y^) = y^A, (56) 

in which the code matrix A has n rows and k columns with 
each entry taking values from — 1}. The coding 

rate is 

R=-logM. (57) 

n 

The decoder first combines the encoded messages into a 
single message as 

f{x^) + fiy^ mod M. (58) 

The final decoded message is given by 

5” = (59) 

where (j) the the minimum entropy decoding function. Follow¬ 

ing the same error probability analysis for the binary case, 
we can show that there exists a codebook /* (and hence a 
particular encoding matrix A) that achieves a probability of 
decoding error ^ 0 exponentially as n ^ cx) if 

R > He{Z) = He{X\Y) = He{Y\X). (60) 


Rx>He{X\Y), (61) 

RY>He{Y\X), (62) 


we can reconstruct = X^ + Y'^ mod M at the decoder 
with an exponentially diminishing error probability. 

After obtaining Z^, which is equal to Z^ with a probability 
converging to 1 exponentially, our estimator is 


n(M-l) ■ 

Following similar steps as those in the binary case, we can 
show that, if ([6T])-([62|) are satisfied, the estimator in is 
asymptotically unbiased and achieves a variance index 


v[e] 


6»[1 - 6>(M - 1)] 

M- 1 


(64) 


We can further show that ( [64| ) is the best variance index that 
can be achieved even in the centralized case. This implies that 
our scheme achieves the centralized performance using rates 
outside the Slepian-Wolf region. 


V. Simulation Results 

In this section, we compare our estimator to the best known 
estimator by Han and Amari lO . In the simulation, we fix the 
unknown parameter 0 and change the encoding rates Rx and 
Ry such that 


Rx=Ry = R>He{Z). (65) 

We conduct the comparison for M = 2 and M = 4 
respectively. 

For M = 2, the variance index of our estimator is (HU, 
while the variance index of the estimator by Han and Amari 
is calculated in example 3 of m 

(Var,[0])ff^ ci (66) 

where a and b are functions of Rx and Ry, whose expressions 
are given in (14.12) and (14.13) of 111, respectively. 



Fig. 3: Performance Comparison: 6> = 0.05, M = 2 























Fig. 4: Performance Comparison: ^ = 0.9, M = 2 



Fig. 5: Performance Comparison: ^ = 0.01, M = 4 


Fig. and Fig. show the performance gain, in terms of 
the variance index, of our estimator over Han and Amari’s 
estimator for binary symmetric sources (M = 2) at two 
different values of the unknown parameter, 0 = 0.05 and 
0 = 0.9, respectively. The performance difference is more 
noticeable at low rates. For 0 = 0.05, the Slepian-Wolf sum 
rate is Rx + Ry = 1.29 bits, while our estimator requires a 
sum rate of Rx + Ry = 2 R = 0.57 bits. For 9 = 0.9, the 
Slepian-Wolf sum rate is 1.47 bits, while our estimator requires 
a sum rate of 0.94 bits. Furthermore, for Han and Amari’s 
estimator to achieve the centralized performance, the required 
sum-rate is 2 bits for both cases, which is not only much larger 
than the sum rate required in our estimator but also much 
larger than the sum-rate required by conditions specified in 
the Slepian-Wolf rate region. 

For M = 4, the variance index of our estimator is given in 


(64). The performance of Han and Amari’s estimator relies on 
the choice of the test channels. The authors did not specify an 
optimal choice of the test channels in order to extend example 
3 in ID to the case of M = 4. We find the following mapping 
to be a natural extension: 


Q = 


0, if X e {0,1} 
1, if X e {2,3}, 


T = 


0, if Ye {0,1} 
1,if ye{2,3}. 


(67) 


Notice that {Q,T) are distributed according to a binary 
symmetric PMF with an unknown parameter a = 20. Using 
an estimator 6* = f leads to the following expression for the 
variance index; 


{YM^])ha - ( 68 ) 

Fig. compares the variance indices achieved using our 
estimator and Han and Amari’s estimator for M = 4 and 0 = 
0.01. It is clear that our estimator outperforms that of Han and 
Amari’s estimator. Furthermore, the performance difference is 
more noticeable at low rates. The Slepian-Wolf sum rate is 
2.24 bits, while our estimator requires a sum rate of 0.48 bits. 


VI. Conclusion 

In this paper, we have answered the question: Are Slepian- 
Wolf rates necessary to achieve the same estimation perfor¬ 
mance as that of the centralized case? We have showed that 
the answer to this question is negative by constructing an 
asymptotically MVUE for binary symmetric sources using 
rates less than the conditions required in the Slepian-Wolf rate 
region. We have also extended our work to a general class 
of information sources by modifying the encoding/decoding 
scheme and the estimation algorithm. We have further com¬ 
pared our results to the best known estimator by Han and 
Amari to show the superiority of our estimator. 
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